Novel low-band phase representation for low bit-rate speech coding
نویسندگان
چکیده
Vector Quantization (VQ) has been extensively used in speech vocoders. Phase information is often ignored or coarsely represented in parametric coders because of the difficulties facing phase quantization. This paper introduces a novel distortion measure for the low-band speech signal that takes phase information into consideration, with no increase in the bit-rate. This measure has been used in the construction of a segmental vocoder, which is using the pitch period as segments. A description of the proposed Time-Domain PhaseAware (TDPA) distortion measure is given and compared to the use of the MFCC as a distortion measure showing the effect of the phase information represented in the TDPA model on improving the inter-frame correlation of the synthesized speech. Finally, the performance of the TDPA is evaluated using the Segmental Signal-to-Noise Ratio (SNR), and Spectral Distortion (SD). Speech quality is evaluated using the recently standardized objective quality measure PESQ.
منابع مشابه
Phase modelling of speech excitation for low bit-rate sinusoidal transform coding
Sinusoidal transform coding (STC) techniques model speech as the sum of sine-waves whose frequencies, amplitudes and phases are specified at regular intervals. To achieve a low-bit rate representation, only the spectral envelope is encoded and the phases are regenerated according to a minimum phase assumption. In this paper, the inaccuracy of the minimum phase model is demonstrated. It is shown...
متن کاملLow bit rate coding for speech and audio using mel linear predictive coding (MLPC) analysis
This paper proposes a low bit rate coding method for speech and audio using a new analysis method named MLPC (Mel-LPC analysis). In the MLPC analysis method a spectrum envelope is estimated on a melor bark-frequency scale, so as to improve the spectral resolution in the low frequency band. This analysis is accomplished with about two-fold increase in computation over the standard LPC analysis. ...
متن کاملLow Rate Sinusoidal Coding of Speech Using an Improved Phase Matching Algorithm*
PHASE MATCHING ALGORITHM* Sassan Ahmadi and Andreas S. Spanias Department of Electrical Engineering Telecommunications Research Center Arizona State University Tempe, AZ 85287-7206 USA ABSTRACT A new phase model for low-bit rate sinusoidal coding of speech is presented. Short-time sinusoidal phases are approximated using a combination of linear prediction, spectral sampling, delay compensation,...
متن کاملCombined speech and audio coding with bit rate and bandwidth scalability
The growing demand for streaming multimedia services over the Internet and recently also over mobile networks has initiated a great interest in coding algorithms which are able to adapt to different transmission environments and to operate under multiple constraints of bit rate, complexity, delay, robustness to bit errors and diversity of input signals. In the light of these recent developments...
متن کاملPerceptually based and embedded wideband CELP coding of speech
This paper presents a novel multi-band CELP coder with the following characteristics: wideband coding (6.5 kHz), variable bit rate (VBR) coding (10-24 kbps), low-delay (10 ms), embeddibility, and perceptually based dynamic bit allocation. The excitation signal of the linear prediction filter is the vector sum of eight off-line pre-filtered bandpass excitation vectors. The eight excitation codeb...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007